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LOGARITHMIC L p BOUNDS FOR MAXIMAL DIRECTIONAL 
SINGULAR INTEGRALS IN THE PLANE 

CIPRIAN DEMETER AND FRANCESCO DI PLINIO 



Abstract. Let K be a Calderon-Zygmund convolution kernel on R. We discuss the 



L p -boundcdncss of the maximal directional singular integral 

Tvf(x) = sup / f(x + tv)K(t) dt 



where V is a finite set of N directions. Logarithmic bounds (for 2 < p < oo) arc 
established for a set V of arbitrary structure. Sharp bounds arc proved for lacunary and 
Vargas sets of directions. The latter include the case of uniformly distributed directions 
and the finite truncations of the Cantor set. 

"^ ■ We make use of both classical harmonic analysis methods and product-BMO based 

time-frequency analysis techniques. As a further application of the latter, we derive an 

_J . L p almost orthogonality principle for Fourier restrictions to cones. 

-i— > 

.a,i 

1. Introduction and main results 

£> ! 1.1. Maximal directional singular integrals. Let m be a Hormander-Mikhlin multi- 

plier on BL For / E C(f(]R 2 ), and v E S 1 , consider the directional multiplier 



(i.i) T v f(x u x 2 )= [ /(euM-^ 1 ^^^. 

rn: Jr2 

O ■ The multiplier T v admits the singular integral representation 

CN' 

T v f{x)= / f(x + tv)K(t)dt, x^supp/, 

Jr 

where K — rh is a Calderon-Zygmund kernel. The most basic example is the directional 
Hilbert transform 

H v f(x)= \ f(x + tv)—, x^supp/, 

Jm l 

which corresponds to the multiplier sign(£). Assume now that we have a vector field 
v : IR 2 — > S* 1 and define the corresponding directional singular integral operator 

T M)( X ) = T v (x)f(x). 

Various cases where the vector field satisfies certain geometric or analytic constraints have 
been recently investigated. In [23] and [20] , Lacey and Li developed a beautiful theory for 
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2 C. DEMETER AND F. DI PLINIO 

C 1+t vector fields. Their analysis revealed intricate connections between the directional 
Hilbert transform, Carleson's theorem on the convergence of Fourier series and a certain 
restricted version of the Kakeya maximal function. 

The Lacey-Li techniques have been recently refined by Bateman and Thiele. They 
combined novel density estimates for rectangles with a new approach to proving vector 
valued inequalities, to obtain IP estimates for H v in the case when the vector field depends 
on just one coordinate [2]. On the other hand, Stein and Street [30] have obtained 
analogous results for the case of real analytic vector fields. 

Continuing the initial investigation from [6], in this paper we analyze another case of 
interest, the one where the range of the vector field is finite. More precisely, we are 
concerned with L p bounds for the maximal directional multiplier 

(1.2) T v f(x) = sup \T v f(x) I , 

where V C S l is a set of N directions. Due to the special role played by the maximal 
directional Hilbert transform, we will reserve for it the notation H\-. 

It has been long recognized that there is a close connection between maximal and 
singular integral operators in harmonic analysis, and indeed, in most classical cases the 
two types of operators have similar boundedness properties. In many ways this paper will 
argue against the generality of this principle. We start by briefly recalling some history. 
The relevant directional maximal function 

(1.3) M v /(x) = sup|M,/(x)| M v f(x)= sup ^- f f(x + tv)dt 

uGV £>0 ■*£ J-e 

has been intensely studied and is now completely understood. The sharp bounds (for 
generic V) 



(1.4) ||M v || 2 ^2<logiV, ||M v || 2 ^ 2 ,oo< v/logiV, ||M v |U P <(logiV)p, p > 2, 

were first proved in [17]. The more recent work [1] shows that bounds become independent 
of TV 

ll-Mv||p->p $1, 1 <p < oo, 

if and only if V is lacunary of finite order. 

In striking contrast to My, the sharp (in terms of N) upper bounds for the operator 
norms ||i7v||p-»p are only known for 1 < p < 2. Taking / to be the indicator function of 
the unit ball and V uniformly distributed, one gets the lower bounds (see [6]) 

\\Hvf\\ P > N*\\f\\ p , Kp<2, \\H v f\\ 2 > logiV||/|| 2 . 

These lower bounds are in fact also upper bounds for generic V. The case p = 2 is a 
simple consequence of the Rademacher-Menshov theorem. Indeed, \Hyf(x)\ is bounded 
by the sum of two operators of the form 



V 



1.5) max > fi(x) 

1 ' Ku<N\ £-^ J3y I 
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where fj are Fourier restrictions of / to disjoint frequency cones rooted at the origin. On 
the other hand, for each TV there exists an orthonormal set {/«• : j — 1, . . . , N} with 



j;Gl 2 : max | \, fj ( x ) 



Ku<N 



> VNlozN 



> 1 



2 



(this is Menshov's counterexample, see [16]); thus one cannot get a weak L bound for 
Hy which is better than the strong L 2 bound by simply invoking Rademacher-Menshov 
type results. 

In the general case, the sharp bound 

||Tv/|| 2 <logiV||/|| 2 

has been established in [6]. Quite surprisingly, the article [15] proves the lower bound 

(1.6) ||i/v||2-2 l0 o > \/iogiV, VVc5 1 ,#V = JV. 

Thus Hy is an unbounded operator on L 2 as soon as V contains infinitely many directions, 
in particular for infinite lacunary sets. This is in sharp contrast with the directional 
maximal function. 

Our main goal is to investigate the sharp dependence on TV of the LP (2 < p < oo) and 
also weak L 2 operator norms of Ty. In analogy with the case of the maximal function 
(1.4), one expects logarithmic-type bounds for all 2 < p < oo. However, at least three 
structural differences between Ty and My make the former harder to treat. The maximal 
directional singular integral is not a positive operator; it is a sum not a maximum of single 
scale operators. Moreover, unlike in the case of My (which is trivially bounded on L°° 
for any V), there is no immediate evidence of an endpoint result for Ty. 

We briefly mention some of the preexisting literature on maximal directional singular 
integrals. In [4], the authors deal with the boundedness of {H v f(x) : x G M n , v G S 1 ™ -1 } 
and of the relative maximal truncations in the mixed norm spaces L p (IR n ; L 9 (S' n ~ 1 )), 
1 < p,q < oo. In the article [9], it is proven that, whenever V C S 1 is a set with 
Minkowski dimension d(V) < 1, My and Hy are bounded operators on the subspace 
of the radial functions in L P (M. 2 ) for p > 1 + d(V), and unbounded for p < 1 + d(V). 
The more recent article [19] reproves that ||-f^v||2->2 ~ logiV when #V = N without 
appealing to Rademacher-Menshov explicitly, and by the same method obtains the sharp 
L 2 bound (which is C^iV" 1 )) for the analogous maximal directional Hilbert transform 
in M. n , n > 2. Our results partially answer both open problems mentioned in [19]. 

1.2. Notation. Throughout the paper, m stands for a Hormander-Mikhlin multiplier on 

E, that is 

m G C°°(M\{0}), \d a m(0\ <« ICI"*, a > 0; 

with corresponding multiplier operator 

T : L 2 (R) -> L 2 (K), Tf(x) = [ f(ZM0e**d£. 

We will also make use of the maximal truncation of T, and adopt the notation 

T*/(x)=sup|T e /(x)|, T £ f(x)= [ f(x-t)K(t)dt, K = rh. 

e>0 J\t\> £ 
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The values of the positive constants C, c, as well as the implicit constants hidden in the 
notation < may vary from line to line, and, unless otherwise specified, are allowed to 
depend only on the multiplier m. 

The Hardy-Littlewood maximal function of / on M. n [n = 1, 2) is denoted by Mf. For 
R C M. 1 or IR 2 , we set K R f = jm f R f, and, for a collection 11 of rectangles in the plane, 
we denote the corresponding maximal function by 



M n f(x) = sup \E R f\l R (x), x e 



)2 



1.3. The main results. The first main result concerns arbitrary finite sets of directions. 
We believe that the exponent one of the term log N gets sharp in the limiting case p — > oo, 
see Conjecture 1.3. 

Theorem 1. Let m be a Hormander-Mihlin multiplier and let be TV defined as in (1.2). 
For any given set V of N directions 

(1.7) \\Tv\\p-> P < CplogN, 2<p<oo. 

Moreover the following endpoint result holds: if supp / C Q C R 2 

(1.8) \{xeQ:\T v f(x)\>\\ogN}\<Cexp(-Cj±-)\Q\, VA > 0. 

Remark 1.1. A simpler proof of this result for Ty = Hy is given in paragraph 6.1.3, 
exploiting the almost //-orthogonality principle in Theorem 4. 

Definition 1.2. An ordered set V = {i>i,V2> • • •} C S 1 (finite or countably infinite) is 
called lacunary with node Voo G S 1 if 

(1.9) \v j+ i - V^ < -\Vj - V^, J = 1,2, .... 

Definition 1.3. [18] A finite set VcS 1 with #V = N is a Vargas set with constant Q 
if 

(1.10) max{^V' : V' is a lacunary subset of V} = QlogN. 

The Vargas sets with constants Q = 0(1) independent of N will simply be referred to 
as Vargas sets. Examples include uniformly distributed sets of directions Vjy = \Vj = 

e 2m N^j = 0, ...,N — l} which are easily seen to be Vargas sets with Q — 1. The N- 
truncations of Cantor sets are also Vargas sets with Q = 4. See Subsection 7.1 for more 
details. 

Finite lacunary sets and Vargas sets are at the opposite ends of the spectrum in terms 
of maximum cardinality of lacunary subsets. In fact, every set of N directions is "at least 
| Vargas" , in the sense that it contains a lacunary subsets with -^|— elements (a proof is 
given in Subsection 7.1). In these two extreme (but rather relevant) cases, we are able to 
improve the result of Theorem 1 and obtain essentially sharp bounds. 

Theorem 2. Let V be a lacunary set of N directions. We have the sharp bound 

(1.11) \\T v \\ p ^ p < pV /logN, Kjxoo. 
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Theorem 3. Let V , #V = N, be a Vargas set with constant Q. We have the following 
bound 



(1-12) ||#v||2-2,oo ^Q V^sN^oglogN) 6 . 

In particular, if V is uniform or a Cantor set then the following essentially sharp bound 

holds 

||#v|| 2 ^2,oc < Vhg~N(\oglogNf. 

Remark 1.4. The (essential) sharpness of Theorem 3 follows immediately via comparison 
with (1.6). It is not clear whether the log log N term can be eliminated. 
Note also that the bound in Theorem 3 looses its strength as Q gets large. 

Interestingly, Theorem 2 and Theorem 3 show that uniformly distributed sets and 
lacunary sets of N directions have the same quantitative behavior on weak L 2 (this is not 
the case for strong L 2 ). 

The methods of proof of Theorem 3 can also be easily modified to prove the estimate 

\\Hvf\\ p < P (logiV)^(loglogiV)^, 2 <p < oo, 

for each Vargas set of cardinality N. We strongly believe these methods are sharp, and 
thus that the exponent of logiV is the correct one for each p > 2. We are thus led to the 
following conjecture, which, as our results show, holds for lacunary sets and (at least in 
the case of the Hilbert transform) Vargas sets. 

Conjecture. Let Tv be defined as in (1.2) and #V = N. We have the bounds 

||Tv|| 2 ^2,oc < Vlog^OoglogAO , ||T v || p ^< p (logAO^(loglogiV)^, 2 < p < oo. 

In light of our Theorem 1, the second conjectured inequality will follow if the first 
inequality is established. 

1.4. The methods and the plan of the paper. Theorem 1 and 2 are proved by 
classical harmonic analysis methods. The main tool behind the proof of Theorem 1, which 
is given in Section 2, is an exponential- type good-lambda inequality for one-dimensional 
singular integrals by Hunt [13], which allows to estimate the contribution of the N singular 
integrals T v with a loss which is only logarithmic. 

Theorem 2 is proved in Section 3. Via an application of the exponential square integra- 
bility inequality by Chang, Wilson, and Wolff [3], we lose a -y/logiV and reduce T v to a 
square function in the frequency Littlewood-Paley annuli, which is shown to be uniformly 
bounded in N when V is lacunary. 

On the other hand, Theorem 3 is proved by means of time-frequency analysis in the 
phase plane. We use a dicretization for H^ very similar to the one introduced by Lacey 
and Li in their pioneering work [23] and [20]. The resulting model sum closely resembles 
the one introduced in [22] for Carleson's operator (see Sections 5 and 6). However, the 
tiles appearing in the model sum for Hv are a bi-parameter family, and are allowed to 
be oriented along each direction in V. These model sums are chopped into single tree 
operators. Our analysis is somewhat similar to the one from [8] which contains a new 
proof of the boundedness of the Carleson operator. The approach from [8] is to show that 
for each point x, the contribution to the Carleson operator comes essentially from one 
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tree. For Vargas sets, there will be at most logiV trees contributing to H-y. This upper 
bound is further lowered to \/Iog N by interpolating with a Bessel inequality. 

One of the things that distinguishes our approach from the ones in [2], [6], [23] and [20] 
is the fact that our trees will be of product nature (i.e. bi-parameter). The bulk of the 
argument is given in Section 7. 

2. Proof of Theorem 1 
Throughout this section, we use the notations 

»Q{A) = J 1a(x) ^, \\f\\ LHQ) := (| |/(a;)|' d/*o(o;))' = (/ \f( x )\ P J^) P 

for Act 2 and 1 < p < oo. Before we enter the proof, we recall two results we cited in 
the introduction, which will be used below: the sharp bound 

(2.1) ||T v /|| 2 ^ 2 <logiV 
which has been proved in [6], and 

(2.2) ||M V /|U P < C(log JV)p||/|| p , 2 < p < oo 
which has first been established in [17]. 

2.1. Maximal truncations of CZ kernels and a good-A inequality. To prove The- 
orem 1, we need two propositions. The first is a consequence of the sharp weighted bound 
for maximal truncations of singular integrals in terms of the weight characteristic. The 
second is a reformulation of a classical good-lambda inequality by Hunt [13]. 

Proposition 2.1. We have the bound ||T*/|| P < Cp||/|| p , for each 2 < p < oo. 

Proposition 2.2. There exist positive absolute constants c ,Ci such that for all f G 
Cg°(R), all A > and all 7 > 4, 

(2.3) |{i6l: \Tf(x)\ > 7A, Mf(x) < c A}| < Ce- Cl7 |{x G R : \TJ(x)\ > A}|. 

Proof of Proposition 2.1. As in [10], we follow Rubio de Francia's method. The recent 
article [14] contains the sharp bound 

(2.4) \\TJ\W {w) <C[w} A2 \\f\\ L 2 {w) , 

where [w]a 2 denotes the A 2 characteristic of the weight w. Let q = (p/2)' > 1. Choose 
v G L q (M.), such that \\v\\ q = 1 and 

||T*/||;= I \TJ{x)\ 2 v{x)dx. 

JR 

By interpolating the trivial L°° bound with the endpoint ||M/||i < C||/||x,io g L, one obtains 
the (sharp) bound ||M|| g ^ g < Cp. Therefore, defining 

00 1 
w = X^ — - — . —M o---oMv 



L p BOUNDS FOR MAXIMAL DIRECTIONAL SINGULAR INTEGRALS 

we have 

v(x) < w(x) a.e., \\w\\ q < 2, Mw(x) < Cpw(x) a.e., [w]a 2 < Cp. 
We can conclude the proof with the chain of inequalities 

\\TJ\\ 2 p = I ' \TJ{x)\ 2 v{x)dx< I \TJ(x)\ 2 w(x)dx 

J R J R 



<C z [w]a 2 ] \f(x)\ z w(x)dx 

<cy\\ni 

where we used (2.4) in going from the first to the second line. □ 

Proof of Proposition 2.2. Let us call G the set on the right-hand side of (2.3). We can 
write G as the countable union of intervals J with disjoint interiors such that 3J (jL G. 
Therefore, (2.3) will follow by summing up the estimate 

(2.5) \{xeJ: \Tf(x)\ > j\,Mf{x) < c X}\ < Ce^\J\. 

Let us prove (2.5). Choose a point x G 3 J fl G c and let J be the interval of length 6| J\ 
centered at x. Set 

/ = /i + h, h — /I j) fi — /l(j) c - 
It is clear that 

(2.6) (2.5) LHS < \{x e J : iTf^x)] > |A,M/(x) < c \}\ + \{x G J : \Tf 2 (x)\ > \ A}| 
Rescaling Proposition 2 of [13] on J, we get that 

(2.7) |{xG J: |T/i(x)| > l\Mf(x) < c X}\ < Ae~^\ J\ < 6Ae~^ 7 |J|. 

for some positive absolute constants A, clq. If we prove that the set in the second term of 
(2.6) RHS is empty for some choice of cq, (2.5) will follow from (2.7), with C = 6A, c\ = f^-. 
This is done as follows. It is known that (see [29], p. 208, eq. (35)) 

(2.8) \Tf 2 (x)-Tf 2 (x)\ < A.iniMf < A iCq X, Vx G J, 

for some absolute constant A\ > 0. Here the second inequality follows because we can 
work under the assumption that the set in (2.5) LHS is nonempty. Also note that 

Tf 2 (x) = T e f(x), e = 6\J\ 

and by our choice of x <£ G, |T e /(a?)| < T+f(x) < X. Hence, if we set c = 4-, we get 

\Tf 2 (x)\ < \Tf 2 (x)-Tf 2 (x)\+TJ(x) <2X< |a, Vx G J, 
and this completes the proof. □ 
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2.2. Proof of Theorem 1. By a limiting argument, we can assume that / is supported 
on a square Q C M. 2 of (finite) sidelength £(Q). We first treat the part of Tyf outside 2Q. 
Let x G 2 k+1 Q\2 k Q, k > 1. We have, using the decay properties of the kernel K, 

r 8 2 (x) ^ r2 k £(Q) 

(2.9) \T v f(x)\< / \f( x+ tv)\\K(t)\dt<—— / \f(x+tv)\dt<CM v \f\(x). 

JS 1 (x) L HVj J~2 k l{Q) 

Here we assumed that the line {x + tv : t G 1R} intersects Q at all £ between S\ (x) and 
5 2 (x), with Si (x) , #2 (x) ~ 2 k £(Q) (otherwise (2.9) holds trivially). Therefore 

(2.10) ||(Tv/)1 k2 \2q|| p < C||M v |/||| p < C(log7V)l||/|| p . 
We now show that 

(2.H) \\Tvfhn2Q) < CphgN\\f\\ LP(Q) , V2 < p < oo; 

this and (2.10) immediately imply (1.7). The weak-type endpoint result (1.8) is then 
obtained as an easy consequence of (2.11) (proceeding as in, for instance, [29, Sect. 
IV.1.3]). 

In short, to prove (2.11), we apply the exponential good-lambda inequality of Proposi- 
tion 2.2 for each T v , which acts separately on each fiber of / along v, to excise N different 
exceptional sets. Due to the exponential decay in the good-lambda inequality, the final 
loss is only logarithmic. 

Proof of (2.11). The range of exponents is open, so by Marcienkiewicz interpolation it 
suffices to prove the weak-type bound 

(2.12) p 2Q ({x G 2Q : T v f(x) > AlogiV}) < ^||/||^ (Q) , A > 0, 2 < p < oo. 
For A > 0, split (cq below is the absolute constant appearing in Proposition 2.2) 

/ = /l + /2, fl(x) = f(x)l{\f\< CoX }, /2 = /-/l, 

We get at once that 

(2.13) bu P |/i(x)|<coA, 

x&Q 

(2.14) HMI^q) < ||/|| LP(Q) , 
(2-15) ||/ 2 ||i 2(Q) < C\ 2 ^\\f\\ p LP{Q) , 

(2.16) /i 2Q ({T v / > 2AlogiV}) < // 2Q ({T v /i > AlogTV}) + // 2Q ({T v / 2 > AlogiV}) 

We estimate the second term in the right-hand side of (2.16); using the L 2 bound of (2.1), 
and (2.15), 

(2.17) M{T v / 2 > AlogiV}) < ^||/ 2 ||! 2(Q) < ^||/||£ w 

We turn to the first term in the right-hand side of (2.16). Preliminarily observe that 
/" 2 q({T v /i > AlogiV}) < ^ 2Q ({T v /i > A log N,M v f < c A}) + // 2Q ({M V / > c A}), 
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and (2.13) implies that the second set on the right hand side is empty. Thus, in view of 
(2.17), (2.12) follows from the estimate 

C p v p 
(2.18) ft 2Q ({x G R 2 : T w g{x) > X\ogN,M v g(x) < c A}) < -^-||^||^ (Q) , 

applied to g = f\. We have completed the proof of (2.12) (and thus, of Theorem 1), up 
to showing (2.18), which we do at the end of the section. □ 

Proof of (2.18). We claim that for each v G S* 1 

(On) p \\n\\ p 
,\yP) \\y\\LP(Q) 



•ci7 



(2.19) fi 2Q ({x G R 2 : \T v (g)(x)\ > j\,M v g(x) < c A}) < Ce 

To prove the claim, assume without loss of generality v = (1,0). Denote by g X2 {') the 
function x\ h-> g{x\^x%). Then 

l^2 Q {{x G R 2 : \T v (g)(x)\ > j\,M v g(x) < c A}) 



^ 717T\ / l{ Xl G M : l T (fe)0£i)l > l\M v g X2 {x x ) < c A}|dx 2 

4 lv| Jm. 

L-J Ce- c ^\{ Xl G R : \T^g X2 )( Xl )\ > \}\dx 2 



~MQ\ 

where the third to last inequality is an application of Proposition 2.2, and last line follows 
from the bound of Proposition 2.1 for T* on L p (xi). Now, using (2.19) for each T v , we 
have 

fi 2Q ({x G R 2 : T v g{x) > (ci^Alog N,M v f{x) < c A}) 
< J2^Q(i x e R2 ■ \ T v(9)(x)\ > (ci) _1 A log N,M v g(x) < c A}) 

which completes the proof of (2.18). D 

3. Lacunary V: Proof of Theorem 2 
We need to prove that for a set V of N directions which is lacunary (see Definition 1.2) 



\\T v f\\ p <C p ^/hg~N\\f\\ p , Kjxoo 

In the related paper [6], the L 2 -estimate of the maximal multiplier (1.2) is reduced (with 
a i/logiV loss) to the L 2 bound for the corresponding square function, via an application 
of the Chang- Wilson- Wolff inequality [3]. See also [12], where the same method had been 
first applied to the study of maximal one-dimensional multipliers, and [7] . We apply the 
same reduction, and then show that the resulting square function is bounded on all L p , 
1 < p < oo uniformly in the cardinality of V (see (3.3)). 
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3.1. Reduction to a square function estimate. Let $ be a Schwartz function sup- 
ported on [1/2, 2] and such that 



J>(2- fe £) = l, V£GM\{0}. 



fcez 



For a function / G C^ 
(3.1) 



set S k f(0 = /(0 $ (2~ fe |f I), and note that 



Ei^/i : 



fcez 



1 < p < oo. 



We summarize the reduction to a square function in the following proposition; the proof 
is contained in the reference [12], see also [6]. 

Proposition 3.1. Let Pi, i = 1, . . . , N be linear (uniformly) bounded multiplier operators 

on R 2 . Denote by Mp = Mp o M p o Mp, where Mpf := (M[f p ])V . Then, for each 
1 < p < oo and each e > we have 



(3.2) 



sup \Pif\ 

8=1,. ..JV 



< 



: , P (logA0* 



E(^ 



l l+e 



fcez 



sup \Pi(S k f)\ 

i=l,...,N 



2\ 2 



By means of Proposition 3.1, and a subsequent use of the Feffer man- Stein inequality (with 
1 + e < p) we get 



E^WM : 



fcez 



< 



E 

feez 



»fc 



followed by (3.1), we reduce Theorem 2 to the proof of the estimate 



(3.3) 



£l T vto/)f 



fcez 



< 



P- 



1 < p < OO. 



Let us recall once again that the easy case p = 2 of (3.3) was proved in [6]. 

3.2. Proof of the square function estimate (3.3). We can harmlessly assume that 
the multiplier m is supported on [0, oo) and that / is supported in the first quadrant. We 
take the node of V to be Voo = (1,0) and order V = {vj = e 2nie ^ Vj '} counterclockwise. 

Since (!;«,) = 0, (1.9) reads |0(u i+1 )| < 2 -1 |0(t/ ; -)|, J > L By eventually splitting V 
into the three subcollections {vj G V : j mod 3 = i}, i G {0, 1,2}, we can work under 
the stronger separation property 



(3.4) 



|0(^- + i)|<2- 3 |0O 



3J\i 



J>1- 



We denote 5 = \9(vx)\. It is clear that for each interval I t := (2" 3 ^ +1 )5, 2~ 3£ 5,}, £ > 1, 
there is at most one j(£) such that 9(vj^) G It. Whenever I e fl {9(v) : v G V} = for 
(at least two consecutive) £,£+ 1, ...,£ + m, we add to V the m directions with angles 
6 G {— 2~ 3i , . . . , _2~ 3 (^+ m_1 )}. We continue to call V the resulting collection and re-index 
the Vj so that they are ordered counterclockwise. By doing so, we have obtained that 

(3.5) 2-^ + ^5 < 5, := \0(v j+1 ) - 6{vj)\ < 2^5, j > 1. 
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It is handy to define Wj = 0(vj) + j, since the assumption on the support of the multiplier 
m implies that for £ = |£|e 27 ™ e ^ in the first quadrant 

(3.6) m(e-^)^0 =► 9(£)e{0,Wj). 
Accordingly, define the intervals 

a = (0,iwi], aj = (wj,w j+ i], j = 1,...,#V- 1, « #v = (w#v,^oo]- 
For an interval / = (an, a r ) C [0, 1], define the frequency cutoff to the cone [e 2nie : 9 G 1} 

We simply write Gj for G a . The bulk of the proof is contained in the next two results. 
In the first lemma, which is proven in Subsection 3.3, a pointwise bound on Ty(Skf) is 
given, in terms (essentially) of the maximal function along the direction v^,. 

Lemma 3.2. Define 

(3.7) / ev = y j G-jjfi /odd = / ^G 2 j+if, 

j j 

and adopt the notation (here v 1 - is the direction orthogonal to v) 



M*f(x) = sup — 

t, 8 >0 'US 



t PS 



I f(x + tv + av L ) | drdcr, x E 



for the bi-parameter maximal function in the coordinates of v. Then 

(3.8) T v (S k f)(x) < M*l [S k f ev ] (x) + M£ [S k f odd ] (x), xe R 2 , keZ. 

The second lemma deals with the boundedness of the rough cutoff multiplier V . SjGj. 
The proof is given in Subsection 3.4. 

Lemma 3.3. We have the estimate 

(3.9) sup |J>A/ <||/|| p , Kp<oo. 



Conclusion of the proof of (3.3). Lemma 3.3 immediately implies the bounds 

(3-10) H/evllp, ||/odd||p<||/||p, Kp<oo. 

We conclude, using the Feffer man- Stein inequality on the vector valued M^ 1 , that 



< 



< 



< 



kez 



-,ez 
fcez 

52\Skf, 



2\ 2 



p 












rf 


+ 

V 


[J2[ M t[s k fodd] 

kez 


r 


+ 


( 


El 

fcez 


Skf odd] J 


V 





ll/evllp+H/, 



odd ||p r~^ 



This completes the proof of (3.3), and in turn, of Theorem 2. 



□ 
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3.3. Proof of Lemma 3.2. The estimate (3.8) is an easy consequence of 

(3.11) \T v .[S k f ev ](x)\ < Af£ [S k U]{x), Wx G M 2 , Vj G V, 

and of the analogue estimate for Sf-f odd- Observe that T V2 . [Skf&] = 7 1 V2 ._ 1 [jS'fc/ ev ], since 

/ ev is zero on the cone bordered by u>2j_i and w 2 j. Thus it suffices to consider odd j. We 
now show (3.11). Let j be fixed throughout. We will omit some dependence on j in the 
notation. Denote by 

(onm) ■= {twj : t > 0} n {|e| = 2 fc }, (Ci,C 2 ) := {^ J+ i : t > 0} n {|e| = 2 fc+1 }, 

As a consequence of (3.4), we have that 

< Ci = 2 k+1 cos(w j+1 ) < 2 fc+1 |^ i+1 )| < 2 k ~ 2 \6(v j )\ < 2 k cos(wj) = m ~ 2 fe 58~ j 

Now, let ^j be a positive Schwartz function on R which is equal to 1 on [2 fc 8 _J_1 5, 2 fc ] and 
vanishes outside of [2 k 8~i~ 2 8, 2 k+1 ]. The idea is that we can insert the vertical frequency 
cutoff given by ipj(£i) harmlessly, because (see (3.6)) 

\SM(£M£ ■ Vj ) + => & > *7i => ^(6) = 1, 

and therefore T^. [Sjt/ev] (£) = S k f ev (£)nj(£), where we set /^(£) = $(2~ k \£\)i>j(€i)m{£-Vj). 
The multiplier /Xj is easily seen to satisfy 

for some constant k > 3. Thus Kj = fij obeys the bound 

\K j (x u x 2 )\ < 2 2k 58- j \l + 2 k 58- j \x l \ + 2 k \x 2 \\~ 3 
This implies 

\T Vj [S k f ev ]\(x 1 ,x 2 ) < \S k f ev * K j \(x 1 ,x 2 ) < M* go [S k fa l ](xi,x 2 ), 
as claimed in (3.11). The proof of Lemma 3.2 is complete. 

3.4. Proof of Lemma 3.3. We need a smooth analogue of Gj. Let /3 be a positive 
Schwartz function with support in [— |, |] and such that YeezPit — i) = 1r(£); define the 
multiplier operator 

Gf?(0 := /(0/3 (^|7p) , e = \Z\e 2 * im € M 2 . 

Let us cover each otj by 128 intervals of equal length a^a = (wj + (£ — 1) y^g, Wj + Pj^] , 
and denote G s a . £ simply by G s j £ . By the definition of /?, we have the identity 

128 

(3.12) G y / = G s (G}f), G)f := J^G^/. 

£=i 

The frequency support of G* e is contained in 2a jj (the interval with same center as a^ 
and twice the length). By (3.5), 8j and <5j +1 are within a factor of 16, thus 

128 

LEW*) < 32. 
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The above observations and the smoothness of (3 imply that G = J2j e £ j,^ S j,i * s a (two- 
dimensional) Hormander-Mihlin multiplier, and therefore 



(3.13) 



sup 22 £ j,zG S j t eh 



E,=0,±1 



j-t 



< 



P- 



1 < p < oo; 



see [28, Theorem IV. 3]. By randomization, we get the square function inequality 



(3.14) 



Eigwoi 5 



< 



I'- 



1 < p < OO. 



We will need the vector-valued inequality of the Proposition below, which is a variant of 
the results of [5] and its refinement [11, Theorem VI. 8.1]. The full range 1 < p < oo is 
obtained by combining the method used in [11] with the sharp weighted bound for the 
Hilbert transform in terms of the A 2 characteristic of the weight [26]. 

Proposition 3.4. For each 1 < p < oo, we have that 



(3.15) 



El 6 *] 



< 



En 



Recall the identity (3.12). Applying (3.15) for hj = G s ,g, and subsequently using (3.14), 
we get that for each g G L 2 fl LP 



(Ei^if 

j 


p 




I" 


< 

p 


(Ei«;»if 




< 


(Ei G ^if 

3,k 


1 

V 


S \\g\ 


p 



When p = 2, the first and the last quantities in the above display coincide, so that 
polarization actually gives the equivalence 



(3.16) 



'-'P \\9 \\p — 



El°: 



9\ 



< C p \\g\ 



p- 



1 < p < oo. 



Applying the left inequality in (3.16) to g — V ■ e fijJ then yields 



|0|Ip~ 



Eig, 



9\ 



< 



Eig./p 



< 



p- 



1 < p < oo. 



where the last step follows from the right inequality in (3.16). The last display is exactly 
the assertion we had to prove. 



4. Grids, adaptedness, and a product John-Nirenberg inequality 

In this section, we define product size and establish a version of the (product) John- 
Nirenberg inequality, Proposition 4.1. Most of these results are well known from previous 
literature. Let us start with some definitions. 
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4.1. Grids. A one-dimensional grid Q is a collection of intervals with the property that 

(4.1) 1,1' eg => I n I' e {0, I, 7'}. 

For z g {0, 1, 2}, 7> = {2 j {£ + ^^ + [0, 1) : j, £ G Z) } are three (dyadic) grids with the 
property that for each interval J C R there exists some J G DUP 1 Ul' 2 with J G I G 9 J 1 . 
For simplicity, we write 7J> for 7J>°; for a scale scl G 2 Z , r>* d = {I E T> 1 : \I\ = scl}. 
A grid Q is said to be A'- sparse if for every two 7, J G Q 

(4.2) |7| < |7'| => \I\ < 2 K \I'\, \I\ = \I'\ => dist(J, I') > 2 K \I'\. 

Right from the definition, if Q is (K + 10)-sparse and 7, I' G £, with |7| = \I'\, the dilates 
2 K+4 I and 2 A+4 7' do not intersect. On the other hand, 1,1' eg, I Gl' =► 2 x+4 7 G 21'. 
Using this property, it follows that if Q is (K + 10)-sparse, 

(4.3) V7 G Q 3.7(1) G I? U P 1 U V 2 with 2 A 7 C J(7) C 9 ■ 2 K I, 

and |7| = |7'| =► J(7) n 7(7') = 0, 7 C 7' =► J(7) C J(7'). Now, observe that 
every dyadic grid g can be written as the disjoint union of K2 K 7\~-sparse dyadic grids. 
It follows that 2 K g := {2^7 : 7 G g} can be written as the union of 3 • K2 K subsets, each 
of which fits into a dyadic grid as described in (4.3). 

4.2. L 2 -adaptedness. Let 7? = Ir x Jr be a rectangle in R 2 with orientation (e^, e^). 
We denote by c(R) its center and by R^ the rectangle 2 fc 7?. Let K be a large positive 
constant. We say that a Schwartz function ipR is L 2 - adapted to the rectangle R if for all 
a, (3 > there exists a constant C a) ^ such that, in the coordinates (e#, e^), 



C a ,p A |xi-c(i?)x| |x 3 -c(i2); 



J/; 



-K 



(4.4) |^O^i.^)l < , r|a+1 .M, +1 ( 1 + in + 

Let 72 be a collection of rectangles (with possibly different orientations). We say that 
{iPr}r£K, are L 2 -adapted to 72 if each ipR is L 2 -adapted with same the constants C a ,p, K. 
If {iPr}r£k are L 2 -adapted and for each 7? we have L 2 4>R — 0, we say that {^>r}r^ti 
are wave packets adapted to 72. If the stronger condition 

(4.5) f *!>r= [ ^R = 0, VxGR 2 

J {x+te R :teR} J {x+te^-.t&M.} 

holds, we say that {iPr\r^-ji are product wave packets adapted to 72. 

4.3. A product John-Nirenberg inequality. Let 72 be a collection of rectangles 7? = 
Ir x Jr with fixed orientation. Assume that X-n := {Ir : 7? G 72.} and Jn := {Jr : 7? G 72} 
are each (respectively) subsets of a dyadic grid X (resp. ,7). We will use the notation 

sh(72) = [j R 
Ren 



By CJ we mean the interval with same center as J and C times longer. If R = I x J is a rectangle, 
CR = CIx CJ. 
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for the shadow of such a collection. Given a collection of complex coefficients B = {&r}_r£7£ 
and functions {iI>r}r£1i, define 

(4.6) B^(x) = J2 b nMx), K' C K. 

R£TZ' 

Let us introduce 

i 

(47) 8ize(B) % 8 s(wW£ |6R|2 )' 

In the following proposition, we show how size is akin to the product BMO norm. 
Proposition 4.1. Let {iPrJr^k be product wave packets adapted to TZ, and with 

(4.8) supp tp R C R. 
We have the estimates 

(4.9) \\BkWp < Cp 2 size(B)\sh(K)\p , 2<p<oo, 

(4.10) \{x E sh(K) : \B n (x)\ > A} | < C7exp (- Cv Q^)|sh(^)|. 

Subsections 4.4 and 4.5 are devoted to the proofs (respectively) of Proposition 4.1 and 
of the related product Chang- Wilson Wolff inequality (Proposition 4.5). We can actually 
show that estimate (4.9) continues to hold even when the restriction (4.8) on the support 
of the ipR is removed. This will be done in Subsection 4.6. 

Remark 4.2. We also detail a one parameter version of Proposition 4.1, whose proof 
follows the same lines. Let TZ be a collection of rectangles as above, with fixed eccentricity 
ij4 = ecc. Let {vIjr}r£ti be wave packets adapted to 71, and with supp ipR C R. Then 

(4.11) |{x e sh(ft) : \B n (x)\ > X}\ < Cexp ( - c-A_) \sh(K)\. 

4.4. Proof of Proposition 4.1. For simplicity, say X = J = T>. Proposition 4.1 relies 
upon two results. First, we establish an inequality similar to (4.9) for the related square 
function 

(4.12) S Mi)=(e¥ 1 ^))'- n ' cn - 

Lemma 4.3. We have the estimate 

(4.13) \\SB n \\ p <Cpsize(B)\sh{K)\K 2<p<oo. 

Second, we obtain a sharp (with respect to p) norm comparison between B^ and SB^. 
Lemma 4.4. We have the estimate 

(4.14) ||B TC || p <0||SB^|| p , 2<p<oo. 
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Thus, (4.9) is obtained by applying Lemmata 4.3 and 4.4. Then, (4.10) follows imme- 
diately from (4.9) by extrapolation. 

The proofs of Lemma 4.3 and Lemma 4.4 are given at the end of this section. Lemma 
4.4 will be obtained as a consequence of the product Chang- Wilson- Wolff inequality of 
[27], which we detail in Proposition 4.5 in the form of sharp (with respect to p, as p — > oo) 
LP bounds between / and its product martingale square function, following the approach 
of [10]. We recall some notation. For I G T>, hj denotes the L? normalized Haar wavelet 
on I. For a rectangle Q = I x J, Hq stands for the tensor product wavelet hi ®hj. We 
introduce the one and two-parameter martingale square functions 

i i 

(4.15) (AJ)(*0=(£|</^>| 2 ^)\ (A 12 )/(x)( £ \(f,h Q )\ 2 ^-Y. 

\ lev ' ' ' ^QeVxV '^' ' 

Proposition 4.5. We have the sharp inequality 

(4.16) ll/llp < Cp||A 12 /|| p , 2<p<oo. 
The proof of Proposition 4.5 is outlined in Subsection 4.5. 

Proof of Lemma 4-3. Let p > 2 and q — |. It then suffices to show that 

(4.17) ||(SB^) 2 || L , (shW) < Cg||M^||^ g , i00 size( J B) 2 |sh(^)|i, 1 < q < oo. 

Indeed, the bound ||Mft|| 9 '_>. g / )00 < Cq follows immediately via weak-type interpolation of 
the trivial L°° bound with the endpoint for the strong maximal function on IR 2 [JMZ] 

(4.18) | {x G sh(ft) : M n f(x) > \}\ < C [ ^^ log (e + ^ 

Jsh(fl) A \ A 

For simplicity, denote K q := ||Mft|| 9 '_>y i00 . To prove (4.17), we assume the following 
claim. 

Claim. For each TZ' c TZ there exists TZ" c TZ' with 

(4.19) |sh(ft")| < l^^l, ||(SB^) 2 || 9 < 27J^size( J B) 2 |sh(^')|" + ||(SB^) 2 || g - 

With the claim in hand, we can easily complete the proof. Apply the claim to TZ' = TZ; 
this yields an TZ" := TZ X with sh(fti) < 2~ 1 sh(ft). Apply the claim with TZ' = TZ X . This 
yields an TZ" = TZ 2 with with sh(TZi) < 2~ 2 sh(TZ). Iterating, one finally gets 

(4.20) ||(SBtt/) 2 |L < CK q size(B) 2 \ V 2~l |sh(ft)|i < CK \ size(ff) 2 |sh(^)|« 

fo 1-2" 

_ i . 

and the proposition follows from the fact that 1 — 2 i > cq~ . 

The claim is proven using duality. By scaling, we can assume size(B) = 1, which simplifies 
the notation. We choose g G L q , \\g\\ q > = 1, so that ||(SB7?/) 2 || g = ((SB-jz 1 ) 2 , g) , and set 

TZ" = {ReTZ' : E K g > 27 K q \sh(TZ')\~7 } . 
By the bound on M^, we have that 

|sh(7T)| < -\sh(TZ')\ and Vi? G TZ'\TZ" , E n g < 27 K\sh(TZ')\~7 . 
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Therefore 

||(SB 7e 2 || g = ((SB^) 2 ^) 

= ( J2 \b R \ 2 ^Tig) + \\{SB nf ,) 2 \\ q \\g\\ ql 

R£TZ'\R" 

<( su P/ E^)( Yl |6«l 2 ) + ll(SB^) a )||, 

R.eK'\K" R£K'\K" 

<L7A^|sh(^)r^ + ||(SB^) 2 ||„ 
and this completes the proof of the claim, and in turn, of Lemma 4.3. □ 

Proof of Lemma 4-4- The lemma is an immediate consequence of Proposition 4.5 applied 
to / = B^, and of the pointwise bound 

(4.21) (A 12 B^)(x) < C7SB^(x), x e sh(ft). 
Let us prove (4.21), by computing A 12 B^ explicitly. Observe that 

(4.22) Q(/LR => (il> R ,h Q ) = 0. 

Indeed, if Q = Iq x Jq is not contained in R = Ir x Jr, and QnR ^ 0, we have that either 
Ir £ Iq-i or Jr £ Jq ( or both). Then (4.22) is a consequence of ijjr having mean zero 
(see (4.5)), and Hq being constant, along each horizontal (resp. vertical) line. Moreover, 
again as a consequence of (4.5) and of the smoothness of ipn, 

\\R\y ' 



(4.23) 


QcR => \(i>R,h Q )\<\ 


Therefore 




[(A 12 B*)(x)] 2 = J2 i 
QeVxV 




£ E 1 

QeVxV 


' v 6 a! q !Vi q (x) 

y R&Z:RDQ V rM 1 '/ 


s E 1 

QeVxV 


. Z^ |_R| \R\ [ Z^ 
y ReK:RDQ 1111/ \Ren-.R2C 



lo(x) 



< 



i?,e7£ ' ' ^fc,£>0 QCR / R&l ' ' 

|/Ql=2- fc |/fl| 

which is exactly (4.21). □ 

4.5. Proof of Proposition 4.5. We follow the approach of [10]. The bound (4.16) is 
obtained as a consequence of the vector-valued inequality for the one parameter square 
function 

(4.24) ||||Q?/)lh/)|| LP <C7^||||A 2 ( ?/ ||, 2(/) || LP , 2<p<oo. 
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Consider the £ 2 -valued function Fj(xi,x 2 ) = (/(■, x 2 ), hj)hi(xi). Then A 12 / = ||A 2 Fj||#2(j), 
so that 



|A 12 /|| p =||||(A 2 iJ»|| fl(J) | 



I A 2 F 7 (xi,-) \\{2(n II p 



/ p 



~l„-7 



>C~ l p 



\Fi(x u 



2 WIIl?„ 



/ P 

■^r-i 



C-V^ 



|*>(zi, 



U 2 (/)| 



/ p 



C"V* HAi/C-.xa)!^ ^C-V 1 ||/(-,* 2 



Ll 2 



ILL 



/ p 



c-V 1 ! 



ip- 



We used the vector- valued bound (4.24) to get to the second line, and the scalar valued 
bound to obtain the last inequality. We have thus proved Proposition 4.5, up to showing 
that (4.24) holds. This is done by applying the Rubio De Francia trick to the weighted 
inequality 



(4.25) 



\\fk\\ 2 i2 (k) w(x 2 )dx2<C[w} Al / \\A 2 fk\\J2 (k) w(x 2 )dx 



>(*)' 



which is obviously equivalent to the scalar case. In turn, the scalar case of (4.25) follows 
from the Chang- Wilson- Wolff good-7 inequality [3] 

|{x 2 :sup|E 7 /| >2A,A 2 /(x 2 ) < 7 A}| <Ce~^\{x 2 : sup |E,/| >A}|. 

X2&I X2&I 

One chooses -\ = c[w]a 1 arid recalls the following property of Ai weights: there exists 
c > such that 

EG I, \E\ < e~ c[w] -^\I\ =► w(E) < §w(Q). 

This yields a weighted good-lambda inequality whose integration gives (the scalar case 
of) (4.25). See [10] for details. 

4.6. Dropping the compact support assumption. Let TZ be as above, and now let 
{iPr}r£-r be product wave packets L 2 - adapted to 1Z for which we no longer assume (4.8). 
Let us prove that (4.9) continues to hold. The main tool is the following lemma, which is 

a variant of [24, Lemma 3.1]. 

Lemma 4.6. Let ipR be a function L 2 adapted to the rectangle R and having mean zero 
along each line parallel to the coordinate axes of R, as in (4.5). Then 



(4.26) 



ip R (x 1 ,x 2 ) = J2 2 100 VhW (si,**), 

fc>0 



whre for each k > 0, {V'.rOo} is a collection of product wave packets adapted to {R^ : 
R E TZ}, with supp ip R (k) C RP°\ The adaptation constants depend only on the adaptation 
constants of ijjr (in particular, they do not depend on k). 



We apply Lemma 4.6 to each ipR, so that 

(4.27) B n (x) = J2 «*(*) = ^V 100 * £ b p ^ 



x 



Ren 



k>0 



pe^( fc ) 



£2 

fc>0 



-100fcp( fc ) 



TlW '"*''■ 



where K^ := {p = R.W : R E K} and we defined B® = {b p := b R : p = R^ E K^}. 
Then, it is easy to see that size(£>( fc )) < size(S). As discussed in Subsection 4.1, TZ^ k ' can 
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be split into Ck 2 2 2k subcollections of rectangles whose horizontal and vertical sides fit 
into a single dyadic grid. Thus, applying (4.9) for each of these subsets, 

||B^ ) || p <C7A; 2 2 2 ysize(i?W)|sh(^)|i. 

Finally, using (4.27), we conclude that 

(4.28) llEfcll, < J2 2 ' W0k \K(4 P < ^ 2 (^2- 90fc size(5( fe ))|sh(^ fc ))|^) 

fc>0 

< Cp 2 size(B)\sh(R)\ 



as claimed. 

5. The time-frequency phase plane 
We briefly recall (see [23]) how to produce a discretization of the multiplier operator 

where a = (0, a) is a subinterval of (0, 1). 

Following [23], we start by decomposing the frequency plane into a union of annular 
sectors. For ann £ 2 Z and u £ V, u £ [0, 1] define 

^W = {£ = leie 2 ^ : |f | £ [f ann, fann), 9 £ w}. 
and 

^ann,aj \-K X ^'ann,w • ^ fc '^-ann,d<jj- 

Here 7£ a nn,w denotes all the rectangles with dimensions ann -1 and |u;| _1 |ann| _1 , which are 
obtained by rotating with angle c(u) the rectangles with the same dimensions from DxP. 
The elements in the collection 

S - I I S 

cj,ann 

will be referred to as tiles. We will typically write a tile s as s = R s xQ s , and also denote 
by ann(s), w(s) the corresponding components. Also, wi s ,u;2s will be the left and right 
dyadic children of u s , while f2 ls , f2 2s will denote the corresponding sectors. The number 
ecc(s) := \u s \ will be referred to as the eccentricity of s. 

Let $ be a positive Schwartz function supported on [|, |] with 

V$ =1 

/ j ^ann ■»•) 

ann 

where $ ann (£) = $(ann _1 t). 

The next step is the partition of unity for l a described in Chapter 7 of [31]. Following 
this approach, the analysis of G a f is reduced to that of operators of the form 

E /(OA,(0(0)W«)- 

w(Z[0,l\:weG 
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Here Q is a fixed dyadic grid, independent of a. For all practical purposes, we can assume 
Q is in fact the standard dyadic grid T>. We will denote by oj±, 0J2 the left and right dyadic 
children of u G T>. Also, each /3 W is a smooth bump function supported on uj\ such that 

(5.1) \\ffl\U < CM'* > j>°- 

Finally, note that the multiplier 

m ann , w (£)=/U0(£))$ann(|£l) 

is supported in and L°° adapted to the rectangle-like region n annA)1 . By applying a stan- 
dard windowed Fourier series we can write 

(5.2) /(£Knn,o,(£) = Yl (/^ S >4(£), 

where for each s = R s x Q s G S annA ,, (p s and ip s are Schwartz functions supported in 
frequency in fl anntUlla and L 2 - adapted to the rectangle R s . 

By putting all these things together, it follows that we can write G a f(x) as a combi- 
nation of model sums of the form 

(5-3) ^(f^ s )i) s {x)l L02s {a)- 

ses u 

By splitting the model sum in two parts, we can further assume that ann(s) G 4 Z for 
each s G S u . In particular, if ann(s) 7^ ann(s') then (ip s ,ip s >) = (ip s ,tp s i) = 

For the purpose of future applications of the theory in Section 4, we remark that ip s 
and ip s are in fact product wave packets adapted to R s . This follows since the canonical 
directions e 2nlc ^ 3 \ e l % + 2nlc ( u >s) of R s do not intersect the frequency support f2 ls . The same 
argument shows that ip s and ip s continue to have the same property if R s is slightly rotated 
so that the canonical directions become v,v x , for any v G U2 S - 

Finally, we mention that the discretization (5.3) will serve us well when we treat the 
operator Hv in the following sections. The fact that we have a sharp cutoff l W23 ( a ) 
in the model sum will be absolutely crucial in our argument. We do not know if this 
decomposition can also be achieved for general kernels K (that is for the operators T-y). 
All earlier papers on the vector field problem used model sums involving smooth cutoff's 
Puj 2s (a) (/3 smooth), which can in turn be realized as building blocks for all Ty. The 
fact that our argument needs rough cutoffs is due to our use of product trees. In short, 
any smooth model for a product tree will produce error terms that are too large to be 
considered negligible. 



6. Model sums, trees, and an L p almost orthogonality principle 

6.1. Trees, size, and an almost orthogonality principle. Let S denote a fixed ar- 
bitrary finite subset of S u . We call trees those subsets of S u which point roughly in the 
same direction. 
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6.1.1. Trees and size. A subset T C S u is a lacunary tree if 

(6.1) U 2 T = f| UJ2s ^ 0. 

seT 
Thus, for a lacunary tree T, the Heisenberg boxes R s x f2 ls of ip s are pairwise disjoint, 
and it is easy to see that we have the almost orthogonality relation 

(6.2) Ek/,^)^ 11/112. 

set 
A subset T C S u is an overlapping tree if 

(6.3) u 1T = p| u 1s ^ 0. 

For an overlapping tree T, the directional support uncertainty intervals {u2 S '. s G T} are 
pairwise disjoint. 

A subset t C S is a conical tree if u^s = u)2 S > —'■ k>t> f° r a U s ) s ' £ t. A conical tree is 
both lacunary and overlapping. Any v T G Co>2t (resp. i>t G uj 1t , v t G w t ) is a top direction 
of the lacunary (resp. overlapping, conical) tree T (resp. t). If T is a lacunary tree and 
s G T, then 



(6.4) 



/ </?s = / (/) s = 0, Vx G 

J{a-+to T :ieM} J{x+to^:teM} 



)2 



The shadow and k- shadow of a tree are respectively defined as 

sh(T) = U{i? s : s G T}, sh fc (T) = U{R^ : s G T}, 

(k) h 

where R s = 2 R s . For future use, we also introduce the notation 

(6.5) crown(T) = (J w 2s C (0, 1) ~ S 1 

seT 

for the crown of a tree T. 

Let / be a given function. The lacunary size of S' C S (with respect to /, whose 
dependence in the notation is suppressed) is given by 

(6.6) size(S<) = sup (^£|</,^) 

lacunary tree 

where (p s are product wave packets adapted to S. The conical size of S' C S is defined as 

(6.7) sizev(S') = sup (t^t^j ^ l(/> ^)| 2 )- 

I V / I oCf 

conical tree 

Since conical trees are in particular lacunary trees, we always have sizev(S') < size(S'). 
We recall for future use [21, Lemma 4.40] 

(6-8) (kh?W^ l(/ '^ >|2 ) 1/2 ~ ll/IU ' 

for each lacunary T. 
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6.1.2. Single tree operators. Let T be a lacunary tree. Let {^p s ,' l f J s}s£T, and for some 
k > 0, {ips }seT, be product wave packets adapted to {-R s } s6 t (resp. {R s Jset)- Assume 
in addition that supp ip s C Hi . Define the model sums 

(6.9) f T = J2(f^s)*Ps, 

sGT 

and 

(6-10) /i fc) = E^^)^ fc) - 

Lemma 6.1. We have the estimate 

(6.11) H/tIIp < Cp 2 size(T), 2<p<oo. 
Lemma 6.2. We have the estimate 

(6.12) |{ xGshfe (T):|/i fc) (x)|>A}|<Cexp(-cyQ^)|sh fe (T)|. 

Proofs of Lemmata 6.1 and 6.2. Estimates (6.11) and (6.12) follow respectively from (4.28) 
and (4.10). Let us explain the details for (6.11). Note that for each s £ S, v T £ U2 S , which 
is within the uncertainty angle of R s . Therefore, property (4.5) still holds if we replace en g 
by vt- Then, we can find a rectangle p s oriented along t> T such that R s C p s C 90R S and 
{p s : s £ T} are products of intervals coming from finitely many dyadic grids. Thus one 
observes that setting b Ps := (f,f s ), gives / T = B{ Ps . seT }, and {ip S) ip s } are still product 
wave packets adapted to {p s : s £ T}. Therefore (4.28) applies. □ 

An application of (4.11) yields a similar result for conical trees. Let t be a conical tree. 
Let {(p s } set , and for some k > 0, {ips /set, be wave packets adapted to {-R s } s6t (resp. 
{Ri k) } se t). Then 

(6.13) \{x £ sh*(t) : |/ t (fc) (^)l > A} | < Cexp ( - C _A_) |sh fe (t)|. 

6.1.3. An LP almost orthogonality principle and an application to H V - We prove the 
following LP almost orthogonality result. 

Theorem 4. Let a be a collection of disjoint intervals. We have the estimate 

(6.i4) (Ell^/llp)'^!!^ v ^ 2 - 

Proof. By restricted-type interpolation, it suffices to show that (6.14) holds for |/| < 1#, 
where E C M 2 has finite measure. We want to use a model sum similar to the one in 
(5.3). However, we use a more straightforward partition of unity for a = (an,a r ), namely 

J-a / Pui- 

UieG'a 

Here C a consists of all intervals u £ T> U V 1 U T> 2 such that oo C a and dist(a;, {«/, a r }) ~ 
|o;|, while /3 U is supported in and L°°- adapted to uj. Since || Ylu&c l^lloo ^ 1, by working 
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with a finite number of subsets we can assume the intervals in C a are pairwise disjoint, 
dist(u,ai) ~ \u\ and prove 

(6-15) (EH/twI5)'^pII/IIp, Vp>2. 

aeOL 

Here 

T(a) = {seS u :cu s e C a }, 

/T(a) = J^ ^' V 9 *)^' 

w s eT a 

and for each s = R s x f2 s , ip s ,ip s are product wave packets adapted to _R S and supported 
in frequency in Q s . The axes of R s are oriented along e 27Tiai , e 1 2 +27Tiai 

Note that T(a) is similar in nature to a lacunary tree, in particular (6.2), (6.8) and 
(6.11) will also hold for it. We chop each T(a) into subsets T CT (a) by means of the 
following standard iterative procedure. 

• INIT Stock := T(a), a ~ 1; 

• WHILE Stock 7^ 0, select the largest collection T a (a) C Stock with 

£ |(/,^)| 2 >^|sh(T CT (a))|. 

seT CT (a) 

Set Stock := Stock\To-(a); a := |. 
This iterative process produces the decomposition 

(6.16) T(a) = |J T ff (a), size(T CT («)) < a, 

and the almost-orthogonality (6.2) of the {tp s : s G T(a)} yields 



(6.17) |sh(T CT («))|<^ 2 £ |(/,^)| 2 <^ 2 ||C7 Q /|| 2 . 

seT CT (a) 

Therefore 

( v^ \\t n p V IIh f ii llll v / II 

{l^WjTWWp) =\\Wh( a )\\tP(OL) p =\\\\l^JT^)\\ \\ p 



<E(EII/T. w lO'^EKEi sh ( T -' a ))i)' 

cr<l aeCK cr<l aeOL 



r<l a£Q 



In the above display, we used the single tree estimate (6.11) of Lemma 6.1 to get the 
second inequality of the second line, (6.17) in going from the second to the third line, and 
disjointness of a G a to conclude. This proves (6.15), and in turn, Theorem 4. □ 
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We can use the L p -orthogonality principle of Theorem 4 to give a simple proof of the 
bound 

||#v/|| P < P logiV||/|| p , 2<p<oo. 

The proof is done by induction on n, where N = 2 n . For the inductive step, assume 
that whenever #V < 2™" 1 , ||.ffv'||p->p < C p (n — 1), where C p is bigger than the implicit 
constant appearing in Theorem 4. Order V increasingly and let V = {i>i G V : i odd}. 
For % odd, let w, be the frequency cone {£ : |4 G | + [1^,1^4.2)}. For each i odd, 

(6.18) t; G RW => |^/(x)| < \H Vi f(x)\ + |^ !+1 [G^/](a;)|, 

and taking suprema and L p norms, 

||^v/||p<ll^v/|| P + sup H Vi+1 [G Ui f] 

i odd 



<C p (n -l)||/|| p + (X) ll^/lg) ? 



— ^p^||J lip; 

which completes the induction. The last line is an application of Theorem 4, while, in 
going from the first to the second line, we have used the induction hypothesis and the 
fact that only one i contributes at each point. 

6.2. The model sum for H\-. We recall that, for a direction v, the directional multiplier 
H v is given by H v f(£) = /(£)sign(i> • £), which is essentially the same as 

where u = exp (27rir(i>) — y). Then, we use (5.3) to define the model sum for H v 

(6-19) Hs/fov) = ^(/,^)^(x)l W23 (i;), 

ses 

where S is any finite subcollection of S u , and we identify t(v) with u. Accordingly, the 
model operator for the maximal multiplier H\- is defined as 

(6.20) HS/(x) = sup|H s /(x,i;)|. 

uev 

Finally, we write H s as a rapidly decaying sum of pieces supported in the dilates sh fc (S), 

k > 0. Applying Lemma 4.6 to each ip s , we write ips — Sfc>o 2~ 100fc V4 > wnere ip s (h) are 

product wave packets adapted to the rectangles {R s : = 2 k R s }, with the mean zero 
property 

(6.21) v G u 2s =► / Vi fc) = / ^ = 0, Vx G M 2 . 



We thus have 

Hs/fot;) = J]2- 100fe ^a^ s )^f(x)l„ ew2a 



fc>o ses 
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Defining 

Hgw/W = sup I V(/,^ s )^ fc) (x)l^ 2s 

we have the pointwise inequality 

Hf(x)\<J2^ 100k Kkf( x )\- 

In particular, we reduced Theorem 3 to the proof of the weak- type bound 
(6.22) UHSw/lkoo <q 2 4fe v4oiiV(loglogiV) 6 . 

7. Vargas type V: Proof of Theorem 3 

Before entering the proof, we begin the section with a more detailed description of 
Vargas sets of directions, and provide some important examples. 

7.1. Vargas sets. We recall Definition 1.3. We say that a set of N directions is a Vargas 
set with constant Q if the longest lacunary sequence (see Definition 1.2) in V has QlogN 
elements. Our definition is easily seen to be equivalent to the one of [18], where the 
terminology has been introduced. For example, uniformly distributed sets 

y N = {v j = ^ F ,j = o,...,N-i} 

are of (uniform in N) Vargas sets with constant 1. Another prime example of (uniformly) 
Vargas sets are finite truncations of Cantor-type sets. 

Claim. The truncated g-adic (q > 3) Cantor sets of N = 2 n elements 

Vat := £ q ,n = < y^Q 3 g~ J , {a 1 ,...,a n ) E {0,q- l} n >, 
< j= i > 

are Vargas sets with constant 4 (independent of q and N). 

Proof. We argue by induction on n. The case n — 1 is trivial. The inductive assumption 
is that for each v < n, a lacunary sequence contained in (£ gi „ has at most 4z/ elements. 
Take a lacunary sequence 

n 

i=i 

Let h be the first integer such that v[ ^ v 2 . Then 

\k- < \vi -v 2 \ < —■ 

As a consequence of the lacunarity property (1.9), it is easy to see that 

\vt-v m \ < 2^2^2-^11, Vm > 2, £ = m,...,r. 

Hence, for each £ = 5, . . . , r, we must have v/ = v§ , for each j = 1, . . . , h. If not, there 
would be some j < h,£ > 5 such that 

V U) J. JJ) _. 2^1 < L. _ .. I < 31 I < 3^_ 
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which gives the contradiction q — 1 < ^q. This shows that {v 5 , . . . ,v r } is a lacunary 
sequence contained in 

J2 «°V, « (1) = vP, . . . , a™ = t;?\ (a {h+1 \ ■ ■ ■ , « W ) G {0, g - 1}^} ~ <£,,„_*, 



so that r — A < 4(n — /i) by the inductive assumption. We get r < An — Ah + 4 < 4n, 
which completes the inductive step. □ 

The claim below shows how Vargas sets and lacunary sets are, in a sense, extremes. 
Claim. Let #V = N. Then V contains a lacunary sequence of -^|— elements. 

Proof. This is best seen by induction on n = -^f— ■ Take as base case n — A. The inductive 
assumption is that every set of at least 2 3t/ directions contains a lacunary sequence of v 
elements, for 4 < v < n. Let V = {v\ < v<i < . . . f 2 3(n+i) } be given. We can rescale 
and assume that V\ = 0, w 2 3 (™+ 1 ) = 1- Define Vj = Vfl [^g-,^-], j = 1,...,8. By 
pigeonholing, at least one Vj has at least 2 3n elements: assume it is so for V 2 (the other 
cases are treated similarly). Then, by inductive assumption, V 2 contains a lacunary 
sequence {u>i, . . . , w n } C [|, |] with node w. Moreover, by repeated application of (1.9), 
dist(w, V 2 ) < 2~( n - 1 ) so that 

(7.1) \w-wx\ <diam(V 2 ) + dist(w,V 2 ) < | + 2^( n ~ 1 ' < \. 

We set Wo := 1^30+1) = 1. Then w G [0, |], so that \w — w n+ i\ > |. Comparing this to 
(7.1), we observe that {w , w±, . . . , w n } is a lacunary sequence of n + 1 elements contained 
in V. The inductive step is complete. □ 

7.2. Preliminary reductions and outline of the proof. Let V be a fixed set of 
directions with ^V = N and Vargas constant Q. Using the notations and the results of 
Section 7, Theorem 3 follows from the weak-type bound 



HWIkoo <Q 2 4fc v / ioiiV(loglog Nf. 



From now on, the dependence on Q of the estimates is kept implicit. By scaling /, it will 
suffice to show that 



■>4fc|| 1 1|2 



(7.2) |{x G M 2 : H* >fc /(x) > 2 2fe v / ioiiV(loglog Nf}\ < 2 4 

We will obtain (7.2) by constructing a set A = A 1 U A 2 C M 2 with the properties 



(7.3) \A\<2 4k \\f\\l H^ ) /(x)<2 2fc v / ioiiV(loglogV) 6 Vz £ A 

This will be done by decomposing S into the disjoint union of forests (i.e. collections of 
trees) S CT , where a is a dyadic parameter ranging in {z G 2 Z : z < 2 K }, which stands for 
the size of the subcollection S CT , as we will detail later. Here, K = size(S). Accordingly, 
using the triangle inequality 

(7-4) H* (fe) /(x)< X>^>/(*)> 

a<2 K 
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and each piece H* (fc) / is supported on shfc(So-). In turn, each S CT will be the union of trees 

So- 

T6 J ff with the properties 

(7-5) £|sh(T)|<<r- 2 ||/|| 2 , 

TeJ- CT 

(7.6) SUp V" lcrown(T)(^) < C log N. 

Property (7.6) will be obtained as a consequence of the Vargas set structure of V. We 
take the first component of the exceptional set A to be 

Al = U U sh *( T )- 

Using (7.5), we have 

\a i \ < Yl Ei sh ^ T )i^ 24fc E ^ 2 n/ii2<2 4fc ii/ii^. 

l<a<2 K t&T a l<a<2 K 

The support of each H* (fc) , 1 < a < 2 K is contained in A 1 , so that 

(7.7) H* s , fc) /(z) < J2 %wf( x ) <EE H TW /(x), x g A 1 . 

ct<1 <r<l Te-F CT 

We now construct the second component of A. We begin by excising an exceptional set 
for each tree T 6 T a appearing in the sum (7.7): setting 

(7.8) A^ c = {xe sh fc (T) : H* w /(x) > 2 2 V|101oga| 2 (loglogiV) 2 }, TeJ ff 
we will show that 

(7.9) |A|J e | <cr 10 2 2fc |sh(T)|. 

Set A 2 ' 81 ™ := UTgj^Arjf^ . Using (7.5), we have the estimate 

(7.10) \A 2 /™\ < J2 \ A tT\ < 2 2 ^ 10 £ l sh (T)| < 2 2 V- 8 ||/|| 2 

T6-F CT TeJ- CT 

We also excise an exceptional set coming from the whole forest S a : 

(7.H) A 2 f ^ := { Y Wr>0»0 £ ^ 2 ! lo S^| 2 }- 

TeJ" CT 

Another application of (7.5) entails 

n -2 2 4fc 

(7 12) {Al ,» n < ^ |shi(T)| < ii/i. 

i fa i TeJ - CT i & i 

Putting all pieces together, we obtain our second component A 2 : 
(7.13) A 2 = (J (A 2 < sizc U A 2 ' count ), |A 2 | < 2 4fc ||/|| 2 . 

the estimate on the measure of A 2 is obtained by summing over a < 1 the estimates in 
(7.10) and (7.12). Thus, the first part of (7.3) holds true. 
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We are left to show the second part of (7.3). Recall that crown (T) = {J{cu = uj 2s '■ s G 
T}. A a consequence of the definition of A 2,connt and of property (7.6), we have that 

(7.14) supsup ^ 1 sh fe (T)(^)lcrown(T)(^) < minllog^cr-^logo-l 2 }. 

Then, for x (jL A, choose v(x) G V which attains the supremum in H* {fc) . We have 

h; ? )/(x) = | J2 E<^>^)W^))| 

= | Yl 1 sh fc (T)(a;)lcrown(T)(^(2 ; ))$^(/ 5 ^)^ fc) (a;) 1 ^)6^ s 

< min{logiV,o-~ 2 |logcr| 2 }( sup H* (fe) /(x)j 

< 2 2fc (log log iV) 2 minjlog Na\ log a\ 2 , a" 1 1 log a| 4 } 
Taking advantage of (7.7), we then get, for x sjL A, 

H5 w /(^)<E H i«^) 

<2 2k (\og\ogN) 2 J2 ff" 1 |loga| 4 + 2 2fc logA^(loglogA^) 2 J^ a \ lo S^\ 

v / IogA r >o- 1 >l a<y/\ogN 

<2 2k ^/hg~N(\og\ogN) 6 , 

which completes the proof of the second part of (7.3). This in turn proves the weak- type 
bound (7.2), and finally, Theorem 3. In the remainder of the section, we will show how to 
achieve a decomposition S = U CT < 2 if S a complying with properties (7.5), (7.6), and whose 
trees satisfy the exceptional set estimate (7.9). 

7.3. Lacunary and overlapping parts of S. First of all, it suffices to assume that 
#(Vflcj2s) > 1 for each s G S. Indeed, tiles with empty VflW2 S do not contribute to the 
model sum H* (fc) . We decompose 

S = LUO, L = {sGS: Vnw ls = 0}, = {sG S: Vnw ls ^0} 

Then, S a will be obtained as the union L CT UO CT . The forest L CT will be the union of lacunary 
trees T G J 7 ^, while the forest O^ will be the union of overlapping trees T G J 7 ®. For 
* G {L, O}, and a < 2 K , we will achieve the properties 



(7.15) Yl l sh (T)|<^ 

(7.16) SUp Y 1 crown(T)(^) < C log N, 



vev 



(7.17) TGj;^ |{x:H^ (fc) /(x)>2 2 V|101og ( 7| 2 (loglogiV) 2 }|< ( 7 10 2 2fc |sh(T)|. 

This will imply the corresponding properties for S CT , and in turn, complete the proof of 
Theorem 3. 
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7.4. The lacunary part L. We begin by noting that the intervals {u = Uu : s G L} 
are pairwise disjoint. Indeed, if we had U\ s strictly contained in Wiv, for s,s' G L, then 
u)2s C uw too. This would force #(V fl ujw) > 1, which we ruled out. As a consequence, 
all trees T C L are of lacunary type, and the almost-orthogonality relation 



(7-18) $^K/,¥Vi 



holds. We organize L into subcollections L(j) by setting 

L(j) = {s G L : V < co 2s < 2^ +1 }, j = 1, . . . , log JV. 

Recall that 7r s = wi s U u^s- For each L(j), we distinguish the minimal (with respect to 
inclusion) dyadic intervals {vrj} inside {tt = tc s : s G L(j)}, and form the lacunary trees 

T(j,£) = {s G L(j) : u 2s D TT?'}, L(j) = U/TO',4 

It is clear that each 7r s contains exactly one Tij. This forces 7r s Pi 7r s / =0 whenever 
s G T(j,£),s' G T(j, £'),£' j£ L Thus for each j the sets {crown (T(j,f)) : £} are pairwise 
disjoint. We conclude that 



(7.19) SUp V" 52 lcrown(T(j,f))(^) < bg N. 

We now construct J 7 ^ = U °? x U£T CT (j, £) by chopping each tree T(j, £) into subtrees with 
size ( TV (j, £)) < a. The chopping is obtained via the following iterative procedure. 

• INIT Stock := T(j,£), a = 2 K ; 

• WHILE Stock ^ 0, select the largest collection T ff (j, I) C Stock with 

£ |(/,^)| 2 >^|sh(T CT (j,£))|. 

seTV(a) 

Set Stock := Stock\TV(j,£); a := f. 
The almost-orthogonality property (7.18) yields that 

log TV 

(7.20) EEi sh ( T -^ £ ))i^^ 2 EK/'^)i 2 ^^ 2 n/ii2' 

so that (7.15) for * = L is verified. Moreover, (7.16) follows a fortiori from (7.19) All 
that is left to verify is the single tree estimate (7.17). Let T = T(j,£) G J 7 ^ (j and 
£ have no particular relevance). For each v G V fl crown (T), let us isolate the subtree 
of T contributing at v, that is T(v) = {s G T : v G tu 2s }. Set T(T) = (T(w) : v G 
V n crown(T)}. With this position, using the notation of (6.10), we have 

Htw/W = sup \ff){x)\. 

T'eT(T) 

We claim that, as a consequence of the Vargas set property, T(T) contains at most 
ClogiV distinct subtrees T'. The proof of the claim is postponed at the end of the 
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section. Assuming for now that #T(T) < ClogiV, we can complete the proof of the 
single tree estimate as follows: 

(7.21) \{x : H* (fc) /(o;) > 2 2 V|101oga| 2 (loglogA0 2 }| 

< E |{^:|/i' ) (^)l>2 2 V|101oga| 2 (loglogiV) 2 }| 

T'eT(T) 






2 2fc o-|logo-| 2 (loglogA r ) s 



T'er(T) 

<2 2 V° Yl e- loglogJV |sh fc (T')| < 2 4 V°|sh(T)|. 
T'eT(T) 

We used Lemma 6.2, and the fact that size(T') < a . Thus, the proof of (7.17), for * = L 
is complete. 

Proof of the claim. Ordering Vflcrown(T) = {v±, v 2 , . . .} counterclockwise, we say that Vi 
is a cutoff direction if T(vj) D T(t»j_i). We show that the sequence {v t : v^ is a cutoff} is 
lacunary with a top direction of T, t>x, as its node, and thus it can have at most Clog N 
elements, which proves the claim. By pigeonholing, it suffices to prove this under the 
additional sparseness assumption 

S,s' E T, U 2s C U 2s i =^ 8|W2 S | < |W2 S '|. 

This implies that |crown(T(t>j m ))| > 8|crown(T(t; im+1 ))| for two consecutive cutoffs Vi m 
and v im+1 with i m < i m+1 . Thus 

\v im -v T \ > -|crown(T(^ m ))| > 4|crown(T(t; iro+1 ))| > 2\v im+1 - v T \, 

and this shows that the cutoffs are lacunary with node vt- The claim is proved. □ 

7.5. The overlapping part O. We start with the construction of J 7 ®. The trees of T® 
will be overlapping trees obtained as saturation of conical trees (see the definitions in 
Section 6) with conical size roughly a. Let us give a precise definition of saturation. 

Let t be a conical tree. Note that {R s : s G t} is a collection of dyadic rectangles with 
the same shape, coming from a single dyadic grid. Thus, the tiles t max with R s maximal 
are well defined and sh(t) is the disjoint union of {R s : s £ t max }. The saturation T(t) of 
a conical tree is defined as 

T(t) = {s' £ S : u± s i D u>t, R s ' C 10R S for some s £ t max }. 

We omit the dependence on t on the notation when no confusion arises. Clearly, T is an 
overlapping tree with top u t and |sh(T)| < 10|sh(t)|. 

The next lemma, along the lines of (for example) [31, Lemma 8.2] or [25, Lemma 6.6] 
will be used in the iterative construction of T® . We sketch a proof at the end of the 
subsection. 

Lemma 7.1 (Size Lemma). Let O' G O be a collection of tiles with sizev(O') = a. Then 
O' := 0' small U 0' big where 

(7-22) size v (0^ mall ) < °- 
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and 0' bi = Uxe^T. Each T G J 7 is a saturation of a conical tree t, and 



(7.23) J2 l sh ( T 



<^~ 2 ll/ll2; 



Starting with O' = O (which has sizey(O) = 2 K ) and iteratively applying Lemma 7.1, 
we obtain the decomposition O = U ct < 2 a-O ct , with 

(7.24) size(0 CT ) < a, 

(7.25) O a — M T, and each T G T a is the saturation of a conical tree, 

(7.26) #(w T := {cu : cu = u 2s for some s G r(t)}) < ClogiV, 

(7.27) |sh(T)|<a- 2 ||/||i 

The only property which is not straightforward from the lemma is (7.26). This is an easy 
consequence of the Vargas set property: the intervals of u> T are pairwise disjoint, and 
each contains at least one v G V. Denote with vi any of the directions of V contained 
in u G o; T , \u>\ = 2~ e . Then 2~^~ 1 < \v£ — v^l < 2~^, which means {ve} is a lacunary 
sequence with node vt- Thus, {ve} (and in turn ojt) contains at most ClogiV elements. 
We now reshuffle the trees in T® to make sure that 

(7.28) SUp J2 icrownCT)^) < C log N 

v€S1 T£-F° 

also holds true. For each dyadic interval cu, consider the collection of tiles s G J-~ with 
u 2s = oj. These tiles are partially ordered by the relation s <^C s' if R s C R' s . Then remove 
each tile s which is not maximal with respect to this order relation from the tree T to 
which s belongs, and assign s to the tree T' where the maximal tile s' s.t. s<«' belongs. 
We keep calling the resulting forest JF~ . Since this reshuffling does not modify O a and 
since it only makes u>t and sh(T) smaller, properties (7.24)-(7.27) still hold. 

As a result of the reshuffling, all the tiles s G O a with U2 S = w will belong to the same 
tree T, and (7.28) follows easily. Suppose that the sum in (7.28) is equal to r > 0. Then 
v belongs to r (nested) dyadic intervals uj = {u} and for each of them we have u = uo 2 
for s in some (uniquely determined) tree T. Then, the collection {ui s : uj 2s = u, u G a;} 
is pairwise disjoint and each ui s contains at least one element of V. The same argument 
used to show (7.26) implies that r < ClogiV. 

Having at our disposal (7.23) and (7.28), we have established (7.15) and (7.16) for 
* = O. We are left with proving the single tree estimate (7.17) for an overlapping tree 
T G jFp. We observe that T is a disjoint union of (at most CTogiV) conical trees, 

T = U wea ; T t(w), t(w) := {s G T : u 2s = to}. 

The intervals in o; T are pairwise disjoint, so that, with the notation of (6.13), 

W TW f(x)= sup \f^(x)\. 

Then, the estimate 

|{^:H^ ) /(a:)>2 2fc a|101og ( 7|loglogiV}|<a 10 2 2fc |sh(T)|, 



,s 
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which is even stronger than what we need for (7.17), is obtained in the same way as (7.21), 
applying the estimate for conical trees (6.13) in place of (6.12). This concludes the proof 
of the theorem. 

Proof of Lemma 7. 1 . We insist only on the points which are different from the classical 
size lemma in [22]; see [31] and [25] for more details. 

We can assume that the uncertainty intervals {uj\ s : s G O'} come from a single 2-sparse 
dyadic grid. The following iterative procedure is used to separate Og mall from 0' big : 

• INIT T + , T' = 0, Stock := O', O big := 0; 

• WHILE size v (Stock:) > § 

■ select a conical t with 2_. \{f\Vs)\ 2 > ^l s h(t)|, with u T minimal, and sh(T) 

set 
maximal (with respect to inclusion); 

■ UPDATE JF+ = 7+ U t, 7 = F U T(t), Stock := Stock - T(t). 

When the algorithm stops, set Og mall := Stock, O big = Uxej-T, 0' + = U te j- + t. The 
iteration procedure ensures that sizev(0' small ) < |. We are left with proving the inequality 



(7.29) a 2 J>h(t)|< 



2 
2- 

te^j- 



We will use the notation -F(S') := J2 seS i(f, l p s )fs- Standard arguments reduce the proof 
of (7.29) to showing that 

(7-30) ll^(0' + )|| 2 <a 2 ^|sh(t)|. 

For s e t G JF + , define B(s) = {s 1 G 0' + - t : u ls = u ls >}, C(s) = {s' G 0' + - T : u ls C 
Wis'}. The tiles s' G B(s) U C(s) are the ones with (ip s , tp s r) ^ 0. Then 

\\F(0' + )\\l < J2 \\F(t)\\i+ 52(f,<Ps)(<p„F(B(3)))+ 52(f,<p.)(<p.,F(C(8))) 

te-F-f seo' + seo' + 

The fact that first and second summand obey the bound of (7.30) is a simple consequence 
of the almost-orthogonality of the wave packets inside a single conical tree. The third 
summand is controlled by summing over t G T+ the estimate 

J2(f,Vs)(Vs,F(C(s)))< ( r 2 \sh(t)\. 
set 



Clearly this follows if we establish that, for each s G t 



max • 



(7.31) ^(/,^)(^,F(C( S )))< C t 2 | j R s -|. 

s6t(s) 

where t(s) := {s G t : R s C Rs}- Split the collection t(s) into 

t(s) = (J t(s) (n) , t(s)^ := {s G t(s) : 2 n R s C 5R„ 2 n+1 R s <£ 10R S }. 

n>0 



s' • 
max 
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The rectangles {R s : s E t(s)} come from a single dyadic grid in the plane, hence 

/ , \Rs\ ^ 2 n |i?s|, 

s£t(S)( n ) 

so that (7.31) is obtained by summing over s G t(s)^ and n > the estimate 

(7.32) Yl \(f^s)\\( V s'J)\\(Vs,Vs')\<2- Wn a 2 \R s \. 

s'eC(s) 

We now show (7.32), beginning with two observations. 

Obs.1: let s G t and s' G C(s). The 2-sparseness property we assumed implies u) t C U\ 
so that t had been selected before the tree containing s' (by minimality of Ut). If s G t 
is the one tile for which s G t(s), the saturation procedure then forces 8Rg D i? s / = 0. 

Obs.2: by the same token, let st, s' ^ s" with s',s" G" t. Then 

Wit C uj\ s i fl Wi s », ann(s') = ann(s") => i? s / fl i? s " = 0> 

Indeed, the 2-sparseness ensures Wt C Wif fl Wit". In the case Wif = ujw, the conclusion 
follows because R s > and i? s « are dyadic rectangles with the same orientation and side- 
lengths. If not, assume with no loss in generality that City C Wit", then the conclusion 
follows exactly as in Observation 1. 

Therefore the rectangles {R s r : s' G C(s)} are pairwise disjoint (by Observation 2) and 
do not intersect 2 n R s , (by Observation 1), which is strictly contained in 8R S . Since sizey 
controls the size of the single tiles, |(/, ip 8 )\ < a^\R s \, \{f,ip s '}\ ^ &\/\R s '\- Using this 
fact and the rapid decay of the wave packets, (7.32) is a consequence of 

V I/? ,f u M«)r 100 < / / dist(x, i^Q x -loo 10n 

This concludes the proof of (7.32), and, in turn, of Lemma 7.1 □ 

8. Final remarks 

Let / : M 2 — > C. The main unresolved issue in the paper is whether the model sum 
operator 

MS/(x) = sup | V (f,<f s )ip s (x)l U2a (v)\ 
^ 6V .it 
has the L 2 — > L 2 '°° norm of order 0(\/\og N (\og\og N)°^) , for arbitrary VcS 1 with N 
elements. Here, as before, <p s ,ip s are product wave packets adapted to R s . 
Consider the related operators 

1/2 



S/(x)=SUp(^| Y {f^s)^s{x)l u , 2s {v)\ 2 \ 

x arm s6S u :ann(s)=ann 

«ev\~ \K a \ J 
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sc/(x) = sup sup ( Yl ia^>r^rww) ■ 

Note the striking simplicity of SC. For each x, the contribution comes from tiles with a 
fixed eccentricity (i.e. from a single frequency cone). 

By invoking a simple randomization argument, it easily follows that 

||SC|| L 2^ L 2,oo < ||SQ|| L 2^ L 2,^ < ||S|| L 2^ L 2,oo < II MSII Z,2_> L 2,cx, . 



The fact that ||S||l2_> L 2 < \/log N was proved in [6], and this is optimal for generic 
V. However, we strongly suspect that ||S|| L 2_> L 2,oo < (log log N)°^\ If one proves this, 
then the conjectured bound || MS|| Z/ 2_^ jL 2, OQ < A/log A?" (log log N)°^ follows via the Chang- 
Wilson- Wolff inequality, as described in Section 3. The methods in this paper seem to be 
unable to conclude even the bound ||SC|| L 2^ L 2 l0 o < (log log N) ^. 
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